A Unified Syntactic Model for Parsing Fluent and Disfluent Speech
نویسندگان
چکیده
This paper describes a syntactic representation for modeling speech repairs. This representation makes use of a right corner transform of syntax trees to produce a tree representation in which speech repairs require very few special syntax rules, making better use of training data. PCFGs trained on syntax trees using this model achieve high accuracy on the standard Switchboard parsing task.
منابع مشابه
Improved Syntactic Models for Parsing Speech with Repairs
This paper introduces three new syntactic models for representing speech with repairs. These models are developed to test the intuition that the erroneous parts of speech repairs (reparanda) are not generated or recognized as such while occurring, but only after they have been corrected. Thus, they are designed to minimize the differences in grammar rule applications between fluent and disfluen...
متن کاملAn improved joint model: POS tagging and dependency parsing
Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...
متن کاملA Syntactic Time-Series Model for Parsing Fluent and Disfluent Speech
This paper describes an incremental approach to parsing transcribed spontaneous speech containing disfluencies with a Hierarchical Hidden Markov Model (HHMM). This model makes use of the right-corner transform, which has been shown to increase non-incremental parsing accuracy on transcribed spontaneous speech (Miller and Schuler, 2008), using trees transformed in this manner to train the HHMM p...
متن کاملResearching (Non) Fluent L2 Speakers’ Oral Communication Deficiencies: A Psycholinguistic Perspective
Fluency in a second language (L2) involves a quintessentially cognitive processing system that operates quickly and effectively. The perceived importance of researching fluency through a psycholinguistic lens has motivated the related L2 research to resort to current cognitive speaking-specific models. This study, drawing on Levelt’s (1999a) psycholinguistic model, probed the deficiency sources...
متن کاملNon-Monotonic Parsing of Fluent Umm I mean Disfluent Sentences
Parsing disfluent sentences is a challenging task which involves detecting disfluencies as well as identifying the syntactic structure of the sentence. While there have been several studies recently into solely detecting disfluencies at a high performance level, there has been relatively little work into joint parsing and disfluency detection that has reached that state-ofthe-art performance in...
متن کامل